NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Orthogonal Gated Recurrent Unit With Neumann-Cayley Transformation

https://doi.org/10.1162/neco_a_01710

Zadorozhnyy, Vasily; Mucllari, Edison; Pospisil, Cole; Nguyen, Duc; Ye, Qiang (November 2024, Neural Computation)

In recent years, using orthogonal matrices has been shown to be a promising approach to improving recurrent neural networks (RNNs) with training, stability, and convergence, particularly to control gradients. While gated recurrent unit (GRU) and long short-term memory (LSTM) architectures address the vanishing gradient problem by using a variety of gates and memory cells, they are still prone to the exploding gradient problem. In this work, we analyze the gradients in GRU and propose the use of orthogonal matrices to prevent exploding gradient problems and enhance long-term memory. We study where to use orthogonal matrices and propose a Neumann series–based scaled Cayley transformation for training orthogonal matrices in GRU, which we call Neumann-Cayley orthogonal GRU (NC-GRU). We present detailed experiments of our model on several synthetic and real-world tasks, which show that NC-GRU significantly outperforms GRU and several other RNNs.
more » « less
Full Text Available
Modeling imaged welding process dynamic behaviors using Generative Adversarial Network (GAN) for a new foundation to monitor weld penetration using deep learning

https://doi.org/10.1016/j.jmapro.2024.05.081

Mucllari, Edison; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2024, Journal of Manufacturing Processes)

Full Text Available
Do We Need a New Foundation to Use Deep Learning to Monitor Weld Penetration?

https://doi.org/10.1109/LRA.2023.3270038

Mucllari, Edison; Yu, Rui; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2023, IEEE Robotics and Automation Letters)

Full Text Available
Novel Molecular Representations Using Neumann-Cayley Orthogonal Gated Recurrent Unit

https://doi.org/10.1021/acs.jcim.2c01526

Mucllari, Edison; Zadorozhnyy, Vasily; Ye, Qiang; Nguyen, Duc Duy (May 2023, Journal of Chemical Information and Modeling)

Full Text Available

Search for: All records